14:02
2026-07-01
dmodel.ai
ai-safety
Discovering Concept-Editing Algorithms with LLM Agents
Researchers tasked LLM agents with inventing concept-erasure algorithms that outperform existing methods like LEACE and QLEACE. The best agent-discovered algorithm reduced nonlinear probe accuracy froβ¦